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Internet Mail Architecture 


Abstract 


Over its thirty-five-year history, Internet Mail has changed significantly in scale and complexity, 
as it has become a global infrastructure service. These changes have been evolutionary, rather than 
revolutionary, reflecting a strong desire to preserve both its installed base and its usefulness. To 
collaborate productively on this large and complex system, all participants need to work from a 
common view of it and use a common language to describe its components and the interactions among 
them. But the many differences in perspective currently make it difficult to know exactly what another 
participant means. To serve as the necessary common frame of reference, this document describes the 
enhanced Internet Mail architecture, reflecting the current service. 
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1.1. 


Introduction 


Over its thirty-five-year history, Internet Mail has changed significantly in scale and complexity, 
as it has become a global infrastructure service. These changes have been evolutionary, rather 
than revolutionary, reflecting a strong desire to preserve both its installed base and its usefulness. 
Today, Internet Mail is distinguished by many independent operators, many different components for 
providing service to Users, as well as many different components that transfer messages. 


The underlying technical standards for Internet Mail comprise a rich array of functional capabilities. 
These specifications form the core: 


e Simple Mail Transfer Protocol (SMTP) ([RFC0821], [RFC2821], [RFC5321]) moves a 
message through the Internet. 


* Internet Mail Format (IMF) ([RFC0733], [RFC0822], [RFC2822], [RFC5322]) defines a 
message object. 


e Multipurpose Internet Mail Extensions (MIME) [RFC2045] defines an enhancement to the 
message object that permits using multimedia attachments. 


Public collaboration on technical, operations, and policy activities of email, including those that 
respond to the challenges of email abuse, has brought a much wider range of participants into the 
technical community. To collaborate productively on this large and complex system, all participants 
need to work from a common view of it and use a common language to describe its components and the 
interactions among them. But the many differences in perspective currently make it difficult to know 
exactly what another participant means. 


It is the need to resolve these differences that motivates this document, which describes the realities of 
the current system. Internet Mail is the subject of ongoing technical, operations, and policy work, and 
the discussions often are hindered by different models of email-service design and different meanings 
for the same terms. 


To serve as the necessary common frame of reference, this document describes the enhanced Internet 
Mail architecture, reflecting the current service. The document focuses on: 

e Capturing refinements to the email model 

e Clarifying functional roles for the architectural components 

e Clarifying identity-related issues, across the email service 


e Defining terminology for architectural components and their interactions 


History 


The first standardized architecture for networked email specified a simple split between the user world, 
in the form of Message User Agents (MUAs), and the transfer world, in the form of the Message 
Handling Service (MHS), which is composed of Message Transfer Agents (MTAs) [RFC1506]. The 
MHS accepts a message from one User and delivers it to one or more other Users, creating a virtual 
MUA-to-MUA exchange environment. 


As shown in Figure 1, this architecture defines two logical layers of interoperability. One is directly 
between Users. The other is among the components along the transfer path. In addition, there is 
interoperability between the layers, first when a message is posted from the User to the MHS and later 
when it is delivered from the MHS to the User. 


The operational service has evolved, although core aspects of the service, such as mailbox addressing 
and message format style, remain remarkably constant. The original distinction between the user level 
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and transfer level remains, but with elaborations in each. The term "Internet Mail" is used to refer to 
the entire collection of user and transfer components and services. 


For Internet Mail, the term "end-to-end" usually refers to a single posting and the set of deliveries 
that result from a single transit of the MHS. А common exception is group dialogue that is mediated 
through a Mailing List; in this case, two postings occur before intended Recipients receive an Author's 
message, as discussed in Section 2.1.4. In fact, some uses of email consider the entire email service, 
including Author and Recipient, as a subordinate component. For these services, "end-to-end" refers 
to points outside the email service. Examples are voicemail over email [RFC3801], EDI (Electronic 
Data Interchange) over email [RFC1767], and facsimile over email [RFC4142]. 


User 
^ 
| 
| 
| 
| 


Figure 1: Basic Internet Mail Service Model 


End-to-end Internet Mail exchange is accomplished by using a standardized infrastructure with these 
components and characteristics: 

e Anemail object 

e Global addressing 

“ Ап asynchronous sequence of point-to-point transfer mechanisms 

e No requirement for prior arrangement between MTAs or between Authors and Recipients 

e No requirement for prior arrangement between point-to-point transfer services over the open 

Internet 


e No requirement for Author, Originator, or Recipients to be online at the same time 


The end-to-end portion of the service is the email object, called a "message". Broadly, the message 
itself distinguishes control information, for handling, from the Author's content. 


A precept to the design of mail over the open Internet is permitting User-to-User and MTA-to-MTA 
interoperability without prior, direct arrangement between the independent administrative authorities 
responsible for handling a message. АП participants rely on having the core services universally 
supported and accessible, either directly or through Gateways that act as translators between Internet 
Mail and email environments conforming to other standards. Given the importance of spontaneity and 
serendipity in interpersonal communications, not requiring such prearrangement between participants 
is a core benefit of Internet Mail and remains a core requirement for it. 


Within localized networks at the edge of the public Internet, prior administrative arrangement often 
is required and can include access control, routing constraints, and configuration of the information 
query service. Although Recipient authentication has usually been required for message access since 
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the beginning of Internet Mail, in recent years it also has been required for message submission. In 
these cases, a server validates the client's identity, whether by explicit security protocols or by implicit 
infrastructure queries to identify "local" participants. 


1.2. The Role of This Architecture 


An Internet service is an integration of related capabilities among two or more participating nodes. The 
capabilities are accomplished across the Internet by one or more protocols. What connects a protocol 
to a service is an architecture. An architecture specifies how the protocols implement the service by 
defining the logical components of a service and the relationships among them. From that logical view, 
a service defines what 15 being done, an architecture defines where the pieces are (in relation to each 
other), and a protocol defines how particular capabilities are performed. 


As such, an architecture will more formally describe a service at a relatively high level. A protocol 
that implements some portion of a service will conform to the architecture to a greater or lesser extent, 
depending on the pragmatic tradeoffs they make when trying to implement the architecture in the 
context of real-world constraints. Failure to precisely follow an architecture is not a failure of the 
protocol, nor is failure to precisely cast a protocol a failure of the architecture. Where a protocol varies 
from the architecture, it will of course be appropriate for it to explain the reason for the variance. 
However, such variance is not a mark against a protocol: Happily, the IETF prefers running code to 
architectural purity. 


In this particular case, this architecture attempts to define the logical components of Internet email and 
does so post hoc, trying to capture the architectural principles that the current email protocols embody. 
To different extents, email protocols will conform to this architecture more or less well. Insofar as this 
architecture differs from those protocols, the reasons are generally well understood and are required for 
interoperation. The differences are not a sign that protocols need to be fixed. However, this architecture 
is a best attempt at a logical model of Internet email, and insofar as new protocol development varies 
from this architecture, it is necessary for designers to understand those differences and explain them 
carefully. 


1.3. Document Conventions 


References to structured fields of a message use a two-part dotted notation. The first part cites 
the document that contains the specification for the field, and the second part is the name of the 
field. Hence <RFC5322.From> is the IMF From: header field in an email content header, апа 
<RFC5321.MailFrom> is the address in the SMTP "Mail From" command. 


When occurring without the IMF (RFC 5322) qualifier, header field names are shown with a colon 
suffix. For example, From:. 


References to labels for actors, functions or components have the first letter capitalized. 
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2.1. 


Responsible Actor Roles 


Internet Mail is a highly distributed service, with a variety of Actors playing different roles. These 
Actors fall into three basic types: 


e User 
e Message Handling Service (MHS) 
e ADministrative Management Domain (ADMD) 
Although related to a technical architecture, the focus on Actors concerns participant responsibilities, 


rather than functionality of modules. For that reason, the labels used are different from those used in 
classic diagrams of email architecture. 


User Actors 


Users are the sources and sinks of messages. Users can be people, organizations, or processes. They 
can have an exchange that iterates, and they can expand or contract the set of Users that participate in 
a set of exchanges. In Internet Mail, there are four types of Users: 


e Authors 

e Recipients 

e Return Handlers 
e Mediators 


Figure 2 shows the primary and secondary flows of messages among them. As a pragmatic heuristic: 
User Actors can generate, modify, or look at the whole message. 


< < 


757707777774 


v 


| 
| 
| 
| 
| 
| 
| 
| 
I 


Figure 2: Relationships among User Actors 


From a User's perspective, all message-transfer activities are performed by a monolithic Message 
Handling Service (MHS), even though the actual service can be provided by many independent 
organizations. Users are customers of this unified service. 


Whenever any MHS Actor sends information back to an Author or Originator in the sequence of 
handling a message, that Actor is a User. 
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2.1.1. Author 


The Author is responsible for creating the message, its contents, and its list of Recipient addresses. 
The MHS transfers the message from the Author and delivers it to the Recipients. The MHS has an 
Originator role (Section 2.2.1) that correlates with the Author role. 


2.1.2. Recipient 


The Recipient is a consumer of the delivered message. The MHS has a Receiver role (Section 2.2.4) 
that correlates with the Recipient role. This is labeled Recv in Figure 3. 


Any Recipient can close the user-communication loop by creating and submitting a new message 
that replies to the Author. An example of an automated form of reply is the Message Disposition 
Notification (MDN), which informs the Author about the Recipient's handling of the message. (See 
Section 4.1.) 


2.1.3. Return Handler 


Also called "Bounce Handler", the Return Handler is a special form of Recipient tasked with servicing 
notifications generated by the MHS as it transfers or delivers the message. (See Figure 3.) These notices 
can be about failures or completions and are sent to an address that is specified by the Originator. This 
Return Handling address (also known as a Return Address) might have no visible characteristics in 
common with the address of the Author or Originator. 


2.1.4. Mediator 


A Mediator receives, aggregates, reformulates, and redistributes messages among Authors and 
Recipients who are the principals in (potentially) protracted exchanges. This activity is easily confused 
with the underlying MHS transfer exchanges. However, each serves very different purposes and 
operates in very different ways. 


When mail is delivered to the Mediator specified in the RFC5321.RcptTo command for the original 
message, the MHS handles it the same way as for any other Recipient. In particular, the MHS sees each 
posting and delivery activity between sources and sinks as independent; it does not see subsequent 
re-posting as a continuation of a process. Because the Mediator originates messages, it can receive 
replies. Hence, when submitting a reformulated message, the Mediator is an Author, albeit an Author 
actually serving as an agent of one or more other Authors. So a Mediator really is a full-fledged User. 
Mediators are considered extensively in Section 5. 


A Mediator attempts to preserve the original Author's information in the message it reformulates but 
is permitted to make meaningful changes to the message content or envelope. The MHS sees а new 
message, but Users receive a message that they interpret as being from, or at least initiated by, the 
Author of the original message. The role of a Mediator is not limited to merely connecting other 
participants; the Mediator is responsible for the new message. 


A Mediator's role is complex and contingent, for example, modifying and adding content or regulating 
which Users are allowed to participate and when. The common example of this role is a group Mailing 
List. In a more complex use, a sequence of Mediators could perform a sequence of formal steps, such 
as reviewing, modifying, and approving a purchase request. 


A Gateway is a particularly interesting form of Mediator. It is a hybrid of User and Relay that 
connects heterogeneous mail services. Its purpose is to emulate a Relay. For a detailed discussion, see 
Section 2.2.3. 
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2.2. Message Handling Service (MHS) Actors 


The Message Handling Service (MHS) performs a single end-to-end transfer on behalf of the Author 
to reach the Recipient addresses specified in the original RFC5321.RcptTo commands. Exchanges 
that are either mediated or iterative and protracted, such as those used for collaboration over time, are 
handled by the User Actors, not by the MHS Actors. As a pragmatic heuristic MHS Actors generate, 
modify, or look at only transfer data, rather than the entire message. 


Figure 3 shows the relationships among transfer participants in Internet Mail. Although it shows the 
Originator (labeled Origin) as distinct from the Author, and Receiver (labeled Recv) as distinct from 
Recipient, each pair of roles usually has the same Actor. Transfers typically entail one or more Relays. 
However, direct delivery from the Originator to Receiver is possible. Intra-organization mail services 
usually have only one Relay. 


Figure 3: Relationships among MHS Actors 


2.2.1. Originator 


The Originator ensures that a message is valid for posting and then submits it to a Relay. A message 
is valid if it conforms to both Internet Mail standards and local operational policies. The Originator 
can simply review the message for conformance and reject it if it finds errors, or it can create some or 
all of the necessary information. In effect, the Originator is responsible for the functions of the Mail 
Submission Agent. 


The Originator operates with dual allegiance. It serves the Author and can be the same entity. But its 
role in assuring validity means that it also represents the local operator of the MHS, that is, the local 
ADministrative Management Domain (ADMD). 


The Originator also performs any post-submission, Author-related administrative tasks associated with 
message transfer and delivery. Notably, these tasks pertain to sending error and delivery notices, 
enforcing local policies, and dealing with messages from the Author that prove to be problematic for 
the Internet. The Originator is accountable for the message content, even when it is not responsible for 
it. The Author creates the message, but the Originator handles any transmission issues with it. 
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2.2.2. Relay 


The Relay performs MHS-level transfer-service routing and store-and-forward by transmitting or 
retransmitting the message to its Recipients. The Relay adds trace information [RFC2505] but does 
not modify the envelope information or the message content semantics. It can modify message content 
representation, such as changing the form of transfer encoding from binary to text, but only as required 
to meet the capabilities of the next hop in the MHS. 


A Message Handling System (MHS) network consists of a set of Relays. This network is above any 
underlying packet-switching network that might be used and below any Gateways or other Mediators. 


In other words, email scenarios can involve three distinct architectural layers, each providing its own 
type of data of store-and-forward service: 


e User Mediators 
e MHS Relays 
e Packet Switches 


The bottom layer is the Internet's IP service. The most basic email scenarios involve Relays and 
Switches. 


When a Relay stops attempting to transfer a message, it becomes an Author because it sends an error 
message to the Return Address. The potential for looping is avoided by omitting a Return Address 
from this message. 


2.2.3. Gateway 


A Gateway is a hybrid of User and Relay that connects heterogeneous mail services. Its purpose is to 
emulate a Relay and the closer it comes to this, the better. A Gateway operates as a User when it needs 
the ability to modify message content. 


Differences between mail services can be as small as minor syntax variations, but they usually 
encompass significant, semantic distinctions. One difference could be email addresses that are 
hierarchical and machine-specific rather than a flat, global namespace. Another difference could 
be support for text-only content or multimedia. Hence the Relay function in a Gateway presents a 
significant design challenge if the resulting performance is to be seen as nearly seamless. The challenge 
is to ensure User-to-User functionality between the services, despite differences in their syntax and 
semantics. 


The basic test of Gateway design is whether an Author on one side of a Gateway can send a useful 
message to a Recipient on the other side, without requiring changes to any components in the Author's 
or Recipient's mail services other than adding the Gateway. To each of these otherwise independent 
services, the Gateway appears to be a native participant. But the ultimate test of Gateway design 
is whether the Author and Recipient can sustain a dialogue. In particular, can a Recipients MUA 
automatically formulate a valid Reply that will reach the Author? 


2.2.4. Receiver 


The Receiver performs final delivery or sends the message to an alternate address. It can also perform 
filtering and other policy enforcement immediately before or after delivery. 
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2.3. Administrative Actors 


Administrative Actors can be associated with different organizations, each with its own administrative 
authority. This operational independence, coupled with the need for interaction between groups, 
provides the motivation to distinguish among ADministrative Management Domains (ADMDs). Each 
ADMD can have vastly different operating policies and trust-based decision-making. One obvious 
example is the distinction between mail that is exchanged within an organization and mail that is 
exchanged between independent organizations. The rules for handling both types of traffic tend to be 
quite different. That difference requires defining the boundaries of each, and this requires the ADMD 
construct. 


Operation of Internet Mail services is carried out by different providers (or operators). Each can be 
an independent ADMD. This independence of administrative decision-making defines boundaries that 
distinguish different portions of the Internet Mail service. A department that operates a local Relay, an 
IT department that operates an enterprise Relay, and an ISP that operates a public shared email service 
can be configured into many combinations of administrative and operational relationships. Each is a 
distinct ADMD, potentially having a complex arrangement of functional components. Figure 4 depicts 
relationships among ADMDs. The benefit of the ADMD construct is that it facilitates discussion about 
designs, policies, and operations that need to distinguish between internal issues and external ones. 


The architectural impact of the need for boundaries between ADMDs is discussed in [Tussle]. 
Most significant is that the entities communicating across ADMD boundaries typically have the 
added burden of enforcing organizational policies concerning external communications. At a more 
mundane level, routing mail between ADMDs сап be an issue, such as needing to route mail between 
organizational partners over specially trusted paths. 


These are three basic types of ADMDs: 
Edge: Independent transfer services in networks at the edge of the open Internet Mail 
service. 


Consumer: | Might be a type of Edge service, as is common for web-based email access. 


Transit: Mail Service Providers (MSPs) that offer value-added capabilities for Edge 
ADMDs, such as aggregation and filtering. 


The mail-level transit service is different from packet-level switching. End-to-end packet transfers 
usually go through intermediate routers; email exchange across the open Internet can be directly 
between the Boundary MTAs of Edge ADMDs. This distinction between direct and indirect interaction 
highlights the differences discussed in Section 2.2.2. 


Figure 4: Administrative Domain (ADMD) Example 


Crocker Informational [10] 


RFC 5598 Email Architecture July 2009 


Edge networks can use proprietary email standards internally. However, the distinction between Transit 
network and Edge network transfer services is significant because it highlights the need for concern 
over interaction and protection between independent administrations. In particular, this distinction calls 
for additional care in assessing the transitions of responsibility and the accountability and authorization 
relationships among participants in message transfer. 


The interactions of ADMD components are subject to the policies of that domain, which cover concerns 
such as these: 

e Reliability 

e Access control 

e Accountability 

e Content evaluation and modification 


These policies can be implemented in different functional components, according to the needs of the 
ADMD. For example, see [КЕС5068]. 


Consumer, Edge, and Transit services can be offered by providers that operate component services or 
sets of services. Further, it is possible for one ADMD to host services for other ADMDs. 


These are common examples of ADMDs: 


Enterprise Service Providers: These ADMDs operate the internal data and/or the 
mail services within an organization. 


Internet Service Providers (ISP): These ADMDs operate the underlying data 
communication services, which are used by one or 
more Relay and User. ISPs are not responsible for 
performing email functions, but they can provide 
an environment in which those functions can be 
performed. 


Mail Service Providers: These ADMDs operate email services, such as for 
consumers or client companies. 


Practical operational concerns demand that providers be involved in administration and enforcement 
issues. This involvement can extend to operators of lower-level packet services. 
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3. Identities 


The forms of identity used by Internet Mail are: mailbox, domain name, message-ID, and ENVID 
(envelope identifier). Each is globally unique. 


3.1. Mailbox 


"A mailbox receives mail. It is a conceptual entity that does not necessarily pertain to file storage." 
[RFC5322] 


A mailbox is specified as an Internet Mail address <addr#spec>. It has two distinct parts, separated 
by an at#sign (@). The right side is a globally interpreted domain name associated with an ADMD. 
Domain names are discussed in Section 3.3. Formal Internet Mail addressing syntax can support source 
routes to indicate the path through which a message ought to be sent. The use of source routes is not 
common and has been deprecated in [RFC5321 ]. 


The portion to the left of the at#sign contains a string that is globally opaque and is called the 
<local#part>. It is interpreted only by the entity specified by the address's domain name. Except as 
noted later in this section, all other entities treat the <local#part> as an uninterpreted literal string 
and preserve all of its original details. As such, its public distribution is equivalent to sending a Web 
browser "cookie" that is only interpreted upon being returned to its creator. 


Some local#part values have been standardized for contacting personnel at an organization. These 
names cover common operations and business functions [RFC2142]. 


It is common for sites to have local structuring conventions for the left-hand side, <local#part>, of an 
<addr#spec>. This permits sub-addressing, such as for distinguishing different discussion groups used 
by the same participant. However, it is worth stressing that these conventions are strictly private to 
the User's organization and are not interpreted by any domain except the one listed in the right side of 
the <addr#spec>. The exceptions are those specialized services that conform to public, standardized 
conventions, as noted below. 


Basic email addressing defines the <local#part> as being globally opaque. However, there are some 
uses of email that add a standardized, global schema to the value, such as between an Author and 
a Gateway. The <local#part> details remain invisible to the public email transfer infrastructure, but 
provide addressing and handling instructions for further processing by the Gateway. Standardized 
examples of these conventions are the telephone numbering formats for the Voice Profile for Internet 
Mail (VPIM) [RFC3801], such as: 


+16137637582@vpim.example.com, 
and iFax ({[RFC3192], [RFC4143] such as: 


FAX=+12027653000/T33S=1387@ifax.example.com. 


3.2. Scope of Email Address Use 


Email addresses are being used far beyond their original role in email transfer and delivery. In practical 
terms, an email address string has become the common identifier for representing online identity. 
Hence, it is essential to be clear about both the nature and role of an identity string in a particular 
context and the entity responsible for setting that string. For example, see Sections 4.1.4, 4.3.3, and 5. 
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3.3. Domain Names 


A domain name is a global reference to an Internet resource, such as a host, a service, or a network. 
A domain name usually maps to one or more IP Addresses. Conceptually, the name can encompass 
an organization, a collection of machines integrated into a homogeneous service, or a single machine. 
A domain name can be administered to refer to an individual User, but this is not common practice. 
The name is structured as a hierarchical sequence of labels, separated by dots (.), with the top of the 
hierarchy being on the right end of the sequence. There can be many names in the sequence -- that 
is, the depth of the hierarchy can be substantial. Domain names are defined and operated through the 
Domain Name System (DNS) ([RFC1034], [RFC1035], [RFC2181 ]). 


When not part of a mailbox address, a domain name is used in Internet Mail to refer to the ADMD or 
to the host that took action upon the message, such as providing the administrative scope for a message 
identifier or performing transfer processing. 


3.4. Message Identifier 


There are two standardized tags for identifying messages: Message-ID: and ENVID. A Message-ID: 
pertains to content, and an ENVID pertains to transfer. 


3.4.1. Message-ID 


IMF provides for, at most, a single Message-ID:. The Message-ID: for a single message, which is a 
user-level IMF tag, has a variety of uses including threading, aiding identification of duplicates, and 
DSN (Delivery Status Notification) tracking. The Originator assigns the Message-ID:. The Recipient's 
ADMD is the intended consumer of the Message-ID:, although any Actor along the transfer path can 
use it. 


Message-ID: is globally unique. Its format is similar to that of a mailbox, with two distinct parts 
separated by an at#sign (@). Typically, the right side specifies the ADMD or host that assigns the 
identifier, and the left side contains a string that is globally opaque and serves to uniquely identify the 
message within the domain referenced on the right side. The duration of uniqueness for the message 
identifier is undefined. 


When a message is revised in any way, the decision whether to assign a new Message-ID: requires a 
subjective assessment to determine whether the editorial content has been changed enough to constitute 
a new message. [RFC5322] states that "a message identifier pertains to exactly one version of a 
particular message; subsequent revisions to the message each receive new message identifiers." Yet 
experience suggests that some flexibility is needed. An impossible test is whether the Recipient will 
consider the new message to be equivalent to the old one. For most components of Internet Mail, there 
is no way to predict a specific Recipient's preferences on this matter. Both creating and failing to create 
a new Message-ID: have their downsides. 


Here are some guidelines and examples: 


e If a message is changed only in form, such as character encoding, it is still the same message. 


e If a message has minor additions to the content, such as a Mailing List tag at the beginning of the 
RFC5322.Subject header field, or some Mailing List administrative information added to the end 
of the primary body part text, it is probably the same message. 


e If a message has viruses deleted from it, it is probably the same message. 


e If a message has offensive words deleted from it, some Recipients will consider it the same 
message, but some will not. 
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e If a message is translated into a different language, some Recipients will consider it the same 
message, but some will not. 


e Ifa message is included in a digest of messages, the digest constitutes a new message. 
e Ifa message is forwarded by a Recipient, what is forwarded is a new message. 


e Ifa message is "redirected", such as using IMF "Resent-*" header fields, some Recipients will 
consider it the same message, but some will not. 


The absence of both objective, precise criteria for regenerating a Message-ID: and strong protection 
associated with the string means that the presence of an ID can permit an assessment that is marginally 
better than a heuristic, but the ID certainly has no value on its own for strict formal reference or 
comparison. For that reason, the Message-ID: is not intended to be used for any function that has 
security implications. 


3.4.2. ENVID 


The ENVID (envelope identifier) can be used for message-tracking purposes ([RFC3885], [RFC3464 |) 
concerning a single posting/delivery transfer. The ENVID labels a single transit of the MHS by a 
specific message. So, the ENVID is used for one message posting until that message is delivered. A 
re-posting of the message, such as by a Mediator, does not reuse that ENVID, but can use a new one, 
even though the message might legitimately retain its original Message-ID:. 


The format of an ENVID is free form. Although its creator might choose to impose structure on the 
string, none is imposed by Internet standards. By implication, the scope of the string is defined by the 
domain name of the Return Address. 
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4. Services and Standards 


The Internet Mail architecture comprises six basic types of functionality, which are arranged to support 
а store-and-forward service. As shown in Figure 5, each type can have multiple instances, some of 
which represent specialized roles. This section considers the activities and relationships among these 
components, and the Internet Mail standards that apply to them. 


Message 

Message User Agent (MUA) 
Author MUA (aMUA) 
Recipient MUA (rMUA) 

Message Submission Agent (MSA) 
Author-focused MSA functions (aMSA) 
MHS-focused MSA functions (RMSA) 

Message Transfer Agent (MTA) 

Message Delivery Agent (MDA) 
Recipient-focused MDA functions (rMDA) 
MHS-focused MDA functions (ҺМОА) 

Message Store (MS) 

Author MS (aMS) 
Recipient MS (rMS) 


This figure shows function modules and the standardized protocols used between them. 
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MHS 


Figure 5: Protocols and Services 


4.1. Message Data 


The purpose of the Message Handling System (MHS) is to exchange an IMF message object among 
participants [RFC5322]. All of its underlying mechanisms serve to deliver that message from its Author 
to its Recipients. A message can be explicitly labeled as to its nature [RFC3458]. 


A message comprises a transit-handling envelope and the message content. The envelope contains 
information used by the MHS. The content is divided into a structured header and the body. The 
header comprises transit-handling trace information and structured fields that are part of the Author's 
message content. The body can be unstructured lines of text or a tree of multimedia subordinate objects, 
called "body-parts" or, popularly, "attachments". [RFC2045], [RFC2046], [RFC2047], [RFC4288], 
[RFC4289], [RFC2049]. 


In addition, Internet Mail has a few conventions for special control data, notably: 
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Delivery Status Notification (DSN): A Delivery Status Notification (DSN) is a message 
that can be generated by the MHS (MSA, MTA, 
ог MDA) and sent to the RFC5321.MailFrom 
address. MDA and MTA are shown as sources 
of DSNs in Figure 5, and the destination is 
shown as Returns. DSNs provide information 
about message transit, such as transfer errors or 
successful delivery [RFC3461]. 


Message Disposition Notification (MDN): A Message Disposition Notification (MDN) is 
a message that provides information about post- 
delivery processing, such as indicating that the 
message has been displayed [RFC3798] or the 
form of content that can be supported [RFC3297]. 
It can be generated by an rMUA and is sent 
to the Disposition-Notification-To addresses. The 
mailbox for this is shown as Disp in Figure 5. 


Message Filtering (SIEVE): Sieve is a scripting language used to specify 
conditions for differential handling of mail, 
typically at the time of delivery [RFC5228]. 
Scripts can be conveyed in a variety of ways, such 
as a MIME part in a message. Figure 5 shows a 
Sieve script going from the rMUA to the MDA. 
However, filtering can be done at many different 
points along the transit path, and any one or 
more of them might be subject to Sieve directives, 
especially within a single ADMD. Figure 5 shows 
only one relationship, for (relative) simplicity. 


4.1.1. Envelope 


Internet Mail has a fragmented framework for transit-related handling information. Information that is 
used directly by the MHS is called the "envelope". It directs handling activities by the transfer service 
and is carried in transfer-service commands. That is, the envelope exists in the transfer protocol SMTP 
[RFC5321]. 


Trace information, such as RFC5322.Received, is recorded in the message header and is not 
subsequently altered [RFC5322]. 


4.1.2. Header Fields 


Header fields are attribute name/value pairs that cover an extensible range of email-service parameters, 
structured user content, and user transaction meta-information. The core set of header fields is 
defined in [RFC5322]. It is common practice to extend this set for different applications. Procedures 
for registering header fields are defined in [RFC3864]. An extensive set of existing header field 
registrations is provided in [RFC4021 ]. 


One danger of placing additional information in header fields is that Gateways often alter or delete 
them. 
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4.1.3. Body 


The body of a message might be lines of ASCII text or a hierarchically structured composition of 
multimedia body part attachments using MIME ([RFC2045], [RFC2046], [RFC2047], [RFC4288], 


and [RFC2049]). 


4.1.4. Identity References in a Message 


Table 1 lists the core identifiers present in a message during transit. 


Layer Field Set By 
Message Body MIME Header Author 
Message header fields From: Author 
Sender: Originator 
Reply-To: Author 
To:, СС:, BCC: Author 
Message-ID: Originator 
Received: Originator, Relay, Receiver 
Return-Path: MDA, from MailFrom 
Resent-*: Mediator 
List-Id: Mediator 
List-*: Mediator 
SMTP HELO/EHLO Latest Relay Client 
ENVID Originator 
MailFrom Originator 
RcptTo Author 
ORCPT Originator 
IP Source Address Latest Relay Client 


Legend: Layer - The part of the email architecture that uses the identifier; Field - The protocol 
construct that contains the identifier; Set By - The Actor role responsible for specifying the 
identifier value (and this can be different from the Actor that performs the fill-in function for the 
protocol construct). 


Table 1: Layered Identities 


These are the most common address-related fields: 


RFC5322.From: Set by - Author 
Names and addresses for Authors of the message content are listed in the 
From: field. 

RFC5322.Reply-To: Set by - Author 


If a Recipient sends a reply message that would otherwise use the 
RFC5322.From field addresses in the original message, the addresses in 
the RFC5322.Reply-To field are used instead. In other words, this field 
overrides the From: field for responses from Recipients. 


RFC5322.Sender: Set by - Originator 
This field specifies the address responsible for submitting the message to 


the transfer service. This field can be omitted if it contains the same address 
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RFC5322.To/.CC: 


RFC5322.BCC: 


RFC5321.HELO/.EHLO: 


RFC3461.ENVID: 


RFC5321.MailFrom: 


КЕС5321.КсріТо: 
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as RFC5322.From. However, omitting this field does not mean that no 
Sender is specified; it means that that header field is virtual and that the 
address in the From: field is to be used. 

Specification of the notifications Return Addresses, which are contained 
in RFC5321.MailFrom, is made by the RFC5322.Sender. Typically, the 
Return address is the same as the Sender address. However, some usage 
scenarios require it to be different. 


Set by - Author 

These fields specify MUA Recipient addresses. However, some or all of 
the addresses in these fields might not be present in the RFC5321.RcptTo 
commands. 

The distinction between To and CC is subjective. Generally, a To addressee 
is considered primary and is expected to take action on the message. A CC 
addressee typically receives a copy as a courtesy. 


Set by - Author 

A copy of the message might be sent to an addressee whose participation 
is not to be disclosed to the RFC5322.To or RFC5322.CC Recipients and, 
usually, not to the other BCC Recipients. The BCC: header field indicates 
a message copy to such a Recipient. Use of this field is discussed in 
[КЕС5322]. 


Set by - Originator, МЅА, МТА 

Any 5МТР client -- including Originator, MSA, or МТА -- can specify 
its hosting domain identity for the SMTP HELO or EHLO command 
operation. 


Set by - Originator 

The MSA can specify an opaque string, to be included in a DSN, as a means 
of assisting the Return Address Recipient in identifying the message that 
produced a DSN or message tracking. 


Set by - Originator 

This field is an end-to-end string that specifies an email address for 
receiving return control information, such as returned messages. The name 
of this field is misleading, because it is not required to specify either 
the Author or the Actor responsible for submitting the message. Rather, 
the Actor responsible for submission specifies the RFC5321.MailFrom 
address. Ultimately, the simple basis for deciding which address needs to 
be in the RFC5321.MailFrom field is to determine which address is to be 
informed about transfer-level problems (and possibly successes). 


Set by - Author, Final MTA, MDA 

This field specifies the MUA mailbox address of a Recipient. The string 
might not be visible in the message content header. For example, the 
IMF destination address header fields, such as RFC5322.To, might specify 
a Mailing List mailbox, while the RFC5321.RcptTo address specifies а 
member of that list. 
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RFC5321.0RCPT: 


RFC5321.Received: 


RFC5321.Return-Path: 


RFC2919.List-Id: 


RFC2369. List-*: 


RFCO0791.SourceAddr: 
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Set by - Originator. 
This is an optional parameter to the RCPT command, indicating the 
original address to which the current RCPT TO address corresponds, after 
a mapping was performed during transit. An ORCPT is the only reliable 
way to correlate a DSN from a multi-Recipient message transfer with the 
intended Recipient. 


Set by - Originator, Relay, Mediator, Dest 
This field contains trace information, including originating host, Relays, 
Mediators, and MSA host domain names and/or IP Addresses. 


Set by - Originator 
The MDA records the RFC5321.MailFrom address into the 
RFC5321.Return-Path field. 


Set by - Mediator, Author 

This field provides a globally unique Mailing List naming framework that 
is independent of particular hosts [RFC2919]. 

The identifier is in the form of a domain name; however, the string usually 
is constructed by combining the two parts of an email address. The result 
is rarely a true domain name, listed in the domain name service, although 
it can be. 


Set by - Mediator, Author 

[RFC2369] defines a collection of message header fields for use by Mailing 
Lists. In effect, they supply list-specific parameters for common Mailing- 
List user operations. The identifiers for these operations are for the list itself 
and the user-as-subscriber [RFC2369]. 


Set by - The Client SMTP sending host immediately preceding the current 
receiving SMTP server 

[RFC0791] defines the basic unit of data transfer for the Internet: the IP 
datagram. It contains a Source Address field that specifies the IP Address 
for the host (interface) from which the datagram was sent. This information 
is set and provided by the IP layer, which makes it independent of mail- 
level mechanisms. As such, it is often taken to be authoritative, although 
it is possible to provide false addresses. 


Interactions at the user level entail protocol exchanges, distinct from those that occur at lower layers 
of the Internet Mail MHS architecture that is, in turn, above the Internet Transport layer. Because the 
motivation for email, and much of its use, is for interaction among people, the nature and details of these 
protocol exchanges often are determined by the needs of interpersonal and group communication. To 
accommodate the idiosyncratic behavior inherent in such communication, only subjective guidelines, 
rather than strict rules, can be offered for some aspects of system behavior. Mailing Lists provide 
particularly salient examples. 
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4.2.1. Message User Agent (MUA) 


A Message User Agent (MUA) works on behalf of User Actors and User applications. It is their 
representative within the email service. 


The Author MUA (aMUA) creates a message and performs initial submission into the transfer 
infrastructure via a Mail Submission Agent (MSA). It can also perform any creation- and posting-time 
archiving in its Message Store (aMS). An MUA aMS can organize messages in many different ways. A 
common model uses aggregations, called "folders"; in IMAP they are called "mailboxes". This model 
allows a folder for messages under development (Drafts), a folder for messages waiting to be sent 
(Queued or Unsent), and a folder for messages that have been successfully posted for transfer (Sent). 
But none of these folders is required. For example, IMAP allows drafts to be stored in any folder, so 
no Drafts folder needs to be present. 


The Recipient MUA (rMUA) works on behalf of the Recipient to process received mail. This 
processing includes generating user-level disposition control messages, displaying and disposing of 
the received message, and closing or expanding the user-communication loop by initiating replies and 
forwarding new messages. 


NOTE: Although not shown in Figure 5, an MUA itself can have a distributed implementation, such 
as a "thin" user-interface module on a constrained device such as a smartphone, with most of 
the MUA functionality running remotely on a more capable server. An example of such an 
architecture might use IMAP [RFC3501] for most of the interactions between an MUA client 
and an MUA server. An approach for such scenarios is defined by [RFC4550]. 


A Mediator is a special class of MUA. It performs message re#posting, as discussed in Section 2.1. 


An MUA can be automated, on behalf of a User who is not present at the time the MUA is active. 
One example is a bulk sending service that has a timed-initiation feature. These services are not to be 
confused with a Mailing List Mediator, since there is no incoming message triggering the activity of 
the automated service. 


A popular and problematic MUA is an automatic responder, such as one that sends out-of-office 
notices. This behavior might be confused with that of a Mediator, but this MUA is generating a new 
message. Automatic responders can annoy Users of Mailing Lists unless they follow [RFC3834]. 


The identity fields are relevant to a typical MUA: 


RFC5322.From 
RFC5322.Reply-To 
RFC5322.Sender 
RFC5322.To, RFC5322.CC 
RFC5322.BCC 


4.2.2. Message Store (MS) 


An MUA can employ a long-term Message Store (MS). Figure 5 depicts an Author's MS (aMS) and a 
Recipient's MS (rMS). An MS can be located on a remote server or on the same machine as the MUA. 


An MS acquires messages from an MDA either proactively by a local mechanism or even by a 
standardized mechanism such as SMTP(!), ог reactively by using POP or IMAP. The MUA accesses 
the MS either by a local mechanism or by using POP or IMAP. Using POP for individual message 
accesses, rather than for bulk transfer, is relatively rare and inefficient. 
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4.3. MHS-Level Services 


4.3.1. Mail Submission Agent (MSA) 


A Mail Submission Agent (MSA) accepts the message submitted by the aMUA and enforces the 
policies of the hosting ADMD and the requirements of Internet standards. An MSA represents an 
unusual functional dichotomy. It represents the interests of the Author (aMUA) during message 
posting, to facilitate posting success; it also represents the interests of the MHS. In the architecture, 
these responsibilities are modeled, as shown in Figure 5, by dividing the MSA into two sub- 
components, aMSA and hMSA, respectively. Transfer of responsibility for a single message, from an 
Author's environment to the MHS, is called "posting". In Figure 5, it is marked as the (S) transition, 
within the MSA. 


The hMSA takes transit responsibility for a message that conforms to the relevant Internet standards 
and to local site policies. It rejects messages that are not in conformance. The MSA performs final 
message preparation for submission and effects the transfer of responsibility to the MHS, via the 
hMSA. The amount of preparation depends upon the local implementations. Examples of aMSA 
tasks include adding header fields, such as Date: and Message-ID:, and modifying portions of the 
message from local notations to Internet standards, such as expanding an address to its formal IMF 
representation. 


Historically, standards-based MUA/MSA message postings have used SMTP [RFC5321]. The 
standard currently preferred is SUBMISSION [RFC4409]. Although SUBMISSION derives from 
SMTP, it uses a separate TCP port and imposes distinct requirements, such as access authorization. 


These identities are relevant to the MSA: 


RFC5321.HELO/.EHLO 
RFC3461.ENVID 
RFC5321.MailFrom 
RFC5321.RcptTo 
RFC5321.Received 
RFC0791.SourceAddr 


4.3.2. Message Transfer Agent (MTA) 


A Message Transfer Agent (MTA) relays mail for one application-level "hop". It is like a packet switch 
or IP router in that its job is to make routing assessments and to move the message closer to the 
Recipients. Of course, email objects are typically much larger than the payload of a packet or datagram, 
and the end-to-end latencies are typically much higher. Relaying is performed by a sequence of MTAs 
until the message reaches a destination MDA. Hence, an МТА implements both client and server 
MTA functionality; it does not change addresses in the envelope or reformulate the editorial content. A 
change in data form, such as to MIME Content-Transfer-Encoding, is within the purview of an MTA, 
but removal or replacement of body content is not. An MTA also adds trace information [RFC2505]. 


NOTE: Within a destination ADMD, email-relaying modules can make a variety of changes to the 
message, prior to delivery. In such cases, these modules are acting as Gateways, rather than 
MTAs. 


Internet Mail uses SMTP ([RFC5321], [RFC2821], [RFC0821]) primarily to effect point-to-point 
transfers between peer MTAs. Other transfer mechanisms include Batch SMTP [RFC2442] and On- 
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Demand Mail Relay (ODMR) SMTP [RFC2645]. As with most network-layer mechanisms, the 
Internet Mail SMTP supports a basic level of reliability, by virtue of providing for retransmission 
after a temporary transfer failure. Unlike typical packet switches (and Instant Messaging services), 
Internet Mail MTAs are expected to store messages in a manner that allows recovery across service 
interruptions, such as host-system shutdown. The degree of such robustness and persistence by an 
MTA can vary. The base SMTP specification provides a framework for protocol response codes. An 
extensible enhancement to this framework is defined in [RFC5248]. 


Although quite basic, the dominant routing mechanism for Internet Mail is the DNS MX record 
[RFC1035], which specifies an MTA through which the queried domain can be reached. This 
mechanism presumes a public, or at least a common, backbone that permits any attached МТА to 
connect to any other. 


MTAs can perform any of these well-established roles: 


Boundary MTA: An MTA that is part of an ADMD and interacts with MTAs in other ADMDs. This 
is also called a Border MTA. There can be different Boundary MTAs, according to 
the direction of mail-flow. 


Outbound MTA: Ап MTA that relays messages to other ADMDs. 


Inbound MTA: Ап MTA that receives inbound SMTP messages from MTA 
Relays in other ADMDs, for example, an MTA running on the 
host listed as the target of an MX record. 


Final MTA: The MTA that transfers a message to the MDA. 


These identities are relevant to the MTA: 


RFC5321.HELO/.EHLO 
RFC3461.ENVID 

RFC5321.MailFrom 

RFC5321.RcptTo 

RFC5322.Received: Set by - Relay Server 
RFC0791.SourceAddr 


4.3.3. Mail Delivery Agent (MDA) 


A transfer of responsibility from the MHS to a Recipient's environment (mailbox) is called "delivery". 
In the architecture, as depicted in Figure 5, delivery takes place within a Mail Delivery Agent (MDA) 
and is shown as the (D) transition from the MHS-oriented MDA component (hMDA) to the Recipient- 
oriented MDA component (rMDA). 


An MDA can provide distinctive, address-based functionality, made possible by its detailed 
information about the properties of the destination address. This information might also be present 
elsewhere in the Recipient's ADMD, such as at an organizational border (Boundary) Relay. However, 
it is required for the MDA, if only because the MDA is required to know where to deliver the message. 


Like an MSA, an MDA serves two roles, as depicted in Figure 5. Formal transfer of responsibility, 
called "delivery", is effected between the two components that embody these roles and is shown 
as "(D)" in Figure 5. The MHS portion (RMDA) primarily functions as a server SMTP engine. A 
common additional role is to redirect the message to an alternative address, as specified by the Recipient 
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4.4. 


4.5. 


addressee's preferences. The job of the Recipient portion of the MDA (rMDA) is to perform any 
delivery actions that the Recipient specifies. 


Transfer into the MDA is accomplished by a normal MTA transfer mechanism. Transfer from an MDA 
to an MS uses an access protocol, such as POP or IMAP. 


NOTE: The term "delivery" can refer to the formal, MHS function specified here or to the first time 
a message is displayed to a Recipient. A simple, practical test for whether the MHS-based 
definition applies is whether a DSN can be generated. 


These identities are relevant to the MDA: 
RFC5321.Return-Path: Set by - Author Originator or Mediator Originator 


The MDA records the RFC5321.MailFrom address into the 
RFC5321.Return-Path field. 


RFC5322.Received: Set by - MDA server 


An MDA can record a Received: header field to indicate trace 
information, including source host and receiving host domain names 
and/or IP Addresses. 


Transition Modes 


From the origination site to the point of delivery, Internet Mail usually follows a "push" model. That is, 
the Actor that holds the message initiates transfer to the next venue, typically with SMTP [RFC5321] 
or the Local Mail Transfer Protocol (LMTP) [RFC2033]. With a "pull" model, the Actor that holds the 
message waits for the Actor in the next venue to initiate a request for transfer. Standardized mechanisms 
for pull-based MHS transfer are ETRN [RFC1985] and ODMR [КЕС2645]. 


After delivery, the Recipient's MUA (or MS) can gain access by having the message pushed to it 
or by having the receiver of access pull the message, such as by using POP [RFC1939] and IMAP 
[RFC3501]. 


Implementation and Operation 


A discussion of any interesting system architecture often bogs down when architecture and 
implementation are confused. An architecture defines the conceptual functions of a service, divided 
into discrete conceptual modules. An implementation of that architecture can combine or separate 
architectural components, as needed for a particular operational environment. For example, a software 
system that primarily performs message relaying is an MTA, yet it might also include MDA 
functionality. That same MTA system might be able to interface with non-Internet email services and 
thus perform both as an MTA and as a Gateway. 


Similarly, implemented modules might be configured to form elaborations of the architecture. An 
interesting example is a distributed MS. One portion might be a remote server and another might be 
local to the MUA. As discussed in [RFC1733], there are three operational relationships among such 
MSs: 
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Online: 


Offline: 


Disconnected: 
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The MS is remote, and messages are accessible only when the MUA is attached 
to the MS so that the MUA will re-fetch all or part of a message from one session 
to the next. 


The MS is local to the User, and messages are completely moved from any remote 
store, rather than (also) being retained there. 


An rMS and a uMS are kept synchronized, for all or part of their contents, while 
they are connected. When they are disconnected, mail can arrive at the rMS and 
the User can make changes to the uMS. The two stores are re-synchronized when 
they are reconnected. 
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5. Mediators 


Basic message transfer from Author to Recipients is accomplished by using an asynchronous store- 
and-forward communication infrastructure in a sequence of independent transmissions through some 
number of MTAs. A very different task is a sequence of postings and deliveries through Mediators. A 
Mediator forwards a message through а re#posting process. The Mediator shares some functionality 
with basic MTA relaying, but has greater flexibility in both addressing and content than is available 
to MTAs. 


This is the core set of message information that is commonly set by all types of Mediators: 


RFC5321.HELO/.EHLO: Set by - Mediator Originator 
RFC3461.ENVID: Set by - Mediator Originator 
КЕС5321.КсріТо: Set by - Mediator Author 
RFC5321.Received: Set by - Mediator Dest 


The Mediator can record received information to indicate the delivery 
to the original address and submission to the alias address. The trace of 
Received: header fields can include everything from original posting, 
through relaying, to final delivery. 


The aspect of a Mediator that distinguishes it from any other MUA creating a message is that a Mediator 
preserves the integrity and tone of the original message, including the essential aspects of its origination 
information. The Mediator might also add commentary. 


Examples of MUA messages that a Mediator does not create include: 
New message that forwards an existing message: 


Although this action provides a basic template for a class of Mediators, its typical occurrence 
is not, itself, an example of a Mediator. The new message is viewed as being from the Actor 
that is doing the forwarding, rather than from the original Author. 
A new message encapsulates the original message and is seen as from the new Originator. 
This Mediator Originator might add commentary and can modify the original message content. 
Because the forwarded message is a component of the message sent by the new Originator, 
the new message creates a new dialogue. However, the final Recipient still sees the contained 
message as from the original Author. 

Reply: 
When a Recipient responds to the Author of a message, the new message is nottypically viewed 
as a forwarding of the original. Its focus is the new content, although it might contain all or 
part of the material from the original message. The earlier material is merely contextual and 
secondary. This includes automated replies, such as vacation out-of-office notices, as discussed 
in Section 4.2.1. 


Annotation: 


The integrity of the original message is usually preserved, but one or more comments about the 
message are added in a manner that distinguishes commentary from original text. The primary 
purpose of the new message is to provide commentary from a new Author, similar to a Reply. 


The remainder of this section describes common examples of Mediators. 
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5.1. Alias 


One function of an МРА is to determine the internal location of a mailbox in order to perform delivery. 
An Alias is a simple re#addressing facility that provides one or more new Internet Mail addresses, 
rather than a single, internal one; the message continues through the transfer service, for delivery to 
one or more alternate addresses. Although typically implemented as part of an MDA, this facility is 
a Recipient function. It resubmits the message, although all handling information except the envelope 
Recipient (rfc5321.RcptTo) address is retained. In particular, the Return Address (rfc5321.MailFrom) 
is unchanged. 


What is distinctive about this forwarding mechanism is how closely it resembles normal MTA store- 
and-forward relaying. Its only significant difference is that it changes the RFC5321.RcptTo value. 
Because this change is so small, aliasing can be viewed as a part of the lower-level mail-relaying 
activity. However, this small change has a large semantic impact: The designated Recipient has chosen 
a new Recipient. 


NOTE: When the replacement list includes more than one address, the alias is increasingly likely to 
have delivery problems. Any problem reports go to the original Author, not the administrator 
of the alias entry. This makes it more difficult to resolve the problem, because the original 
Author has no knowledge of the Alias mechanism. 


Including the core set of message information listed at the beginning of this section, Alias typically 
changes: 


RFC5322.To/.CC/.BCC: Set by - Author 
These fields retain their original addresses. 


RFC5321.MailFrom: Set by - Author 

The benefit of retaining the original MailFrom value is to ensure that 
an Actor related to the originating ADMD knows there has been a 
delivery problem. On the other hand, the responsibility for handling 
problems, when transiting from the original Recipient mailbox to the 
alias mailbox usually lies with that original Recipient, because the 
Alias mechanism is strictly under that Recipient's control. Retaining 
the original MailFrom address prevents this. 


5.2. ReSender 


Also called the ReDirector, the ReSender's actions differ from forwarding because the Mediator 
"splices" a message's addressing information to connect the Author of the original message with the 
Recipient of the new message. This connection permits them to have direct exchange, using their 
normal MUA Reply functions, while also recording full reference information about the Recipient who 
served as a Mediator. Hence, the new Recipient sees the message as being from the original Author, 
even if the Mediator adds commentary. 


Including the core set of message information listed at the beginning of this section, these identities 
are relevant to a resent message: 


RFC5322.From: Set by - original Author 
Names and addresses for the original Author of the message 
content are retained. The free-form (display-name) portion 
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RFC5322.Reply-To: 


RFC5322.Sender: 


КЕС5322.То/.СС/ ВСС: 


RFC5322.Resent-From: 


RFC5322.Resent-Sender: 


RFC5322.Resent-To/-CC/-BCC: 


RFC5321.MailFrom: 


5.3. Mailing Lists 
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of the address might be modified to provide an informal 
reference to the ReSender. 


Set by - original Author 
If this field is present in the original message, it is retained in 
the resent message. 


Set by - Author's Originator or Mediator Originator 


Set by - original Author 
These fields specify the original message Recipients. 


Set by - Mediator Author 

This address is of the original Recipient who is redirecting the 
message. Otherwise, the same rules apply to the Resent-From: 
field as to an original RFC5322.From field. 


Set by - Mediator Originator 

The address of the Actor responsible for resubmitting the 
message. As with RFC5322.Sender, this field can be omitted 
when it contains the same address as RFC5322.Resent-From. 


Set by - Mediator Author 
The addresses of the new Recipients who are now able to reply 
to the original Author. 


Set by - Mediator Originator 

The Actor responsible for resubmission (RFC5322.Resent- 
Sender) is also responsible for specifying the new MailFrom 
address. 


A Mailing List receives messages as an explicit addressee and then re-posts them to a list of subscribed 
members. The Mailing List performs a task that can be viewed as an elaboration of the ReSender. 
In addition to sending the new message to a potentially large number of new Recipients, the Mailing 
List can modify content, for example, by deleting attachments, converting the format, and adding list- 
specific comments. Mailing Lists also archive messages posted by Authors. Still the message retains 
characteristics of being from the original Author. 


Including the core set of message information listed at the beginning of this section, these identities 
are relevant to a Mailing List processor when submitting a message: 


RFC2919.List-Id: Set by - Mediator Author 
RFC2369. List-*: Set by - Mediator Author 
RFC5322.From: Set by - original Author 


Names and email addresses for the original Author of the message content 


are retained. 


RFC5322.Reply-To: | Set by - Mediator or original Author 
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Although problematic, it is common for a Mailing List to assign its own 
addresses to the Reply-To: header field of messages that it posts. This 
assignment is intended to ensure that replies go to all list members, rather 
than to only the original Author. As a User Actor, a Mailing List is the 
Author of the new message and can legitimately set the Reply-To: value. 
As a Mediator attempting to represent the message on behalf of its original 
Author, creating or modifying a Reply-To: field can be viewed as violating 
that Author's intent. When the Reply-To is modified in this way, a reply 
that is meant only for the original Author will instead go to the entire list. 
When the Mailing List does not set the field, a reply meant for the entire 
list can instead go only to the original Author. At best, either choice is a 
matter of group culture for the particular list. 


RFC5322.Sender: Set by - Author Originator or Mediator Originator 
This field usually specifies the address of the Actor responsible for Mailing 
List operations. Mailing Lists that operate in a manner similar to a simple 
MTA Relay preserve as much of the original handling information as 
possible, including the original RFC5322.Sender field. (Note that this 
mode of operation causes the Mailing List to behave much like an Alias, 
with a possible difference in number of new addressees.) 


RFC5322.To/.CC: Set by - original Author 
These fields usually contain the original list of Recipient addresses. 


RFC5321.MailFrom: Set by - Mediator Originator 
Because a Mailing List can modify the content of a message in any way, 
it is responsible for that content; that is, it is an Author. As such, the 
Return Address is specified by the Mailing List. Although it is plausible 
for the Mailing List to reuse the Return Address employed by the original 
Originator, notifications sent to that address after a message has been 
processed by a Mailing List could be problematic. 


5.4. Gateways 


A Gateway performs the basic routing and transfer work of message relaying, but it also is permitted 
to modify content, structure, address, or attributes as needed to send the message into a messaging 
environment that operates under different standards or potentially incompatible policies. When a 
Gateway connects two differing messaging services, its role is easy to identify and understand. 
When it connects environments that follow similar technical standards, but significantly different 
administrative policies, it is easy to view a Gateway as merely an MTA. 


The critical distinction between an MTA and a Gateway is that a Gateway can make substantive 
changes to a message to map between the standards. In virtually all cases, this mapping results in 
some degree of semantic loss. The challenge of Gateway design is to minimize this loss. Standardized 
Gateways to Internet Mail are facsimile [RFC4143], voicemail [RFC3801], and the Multimedia 
Messaging Service (MMS) [RFC4356]. 


A Gateway can set any identity field available to an MUA. Including the core set of message 
information listed at the beginning of this section, these identities are typically relevant to Gateways: 


RFC5322.From: Set by - original Author 
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RFC5322.Reply-To: 


RFC5322.Sender: 


КЕС5322.То/.СС/. ВСС: 


RFC5321.MailFrom: 


5.5. Boundary Filter 
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Names and addresses for the original Author of the message content 
are retained. As for all original addressing information in the message, 
the Gateway can translate addresses as required to continue to be useful 
in the target environment. 


Set by - original Author 

It is best for a Gateway to retain this information, if it is present. The 
ability to perform a successful reply by a Recipient is a typical test of 
Gateway functionality. 


Set by - Author Originator or Mediator Originator 
This field can retain the original value or can be set to a new address. 


Set by - original Recipient 
These fields usually retain their original addresses. 


Set by - Author Originator or Mediator Originator 
The Actor responsible for handling the message can specify a new 
address to receive handling notices. 


To enforce security boundaries, organizations can subject messages to analysis for conformance with 
its safety policies. An example is detection of content classed as spam or a virus. A filter might alter the 
content to render it safe, such as by removing content deemed unacceptable. Typically, these actions 
add content to the message that records the actions. 
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6. Considerations 


6.1. 


6.2. 


Security Considerations 


This document describes the existing Internet Mail architecture. It introduces no new capabilities. 
The security considerations of this deployed architecture are documented extensively in the technical 
specifications referenced by this document. These specifications cover classic security topics, such as 
authentication and privacy. For example, email-transfer protocols can use standardized mechanisms 
for operation over authenticated and/or encrypted links, and message content has similar protection 
standards available. Examples of such mechanisms include SMTP-TLS [RFC3207], SMTP-Auth 
[RFC4954], OpenPGP [RFC4880], and S/MIME [RFC3851]. 


The core of the Internet Mail architecture does not impose any security requirements or functions on 
the end-to-end or hop-by-hop components. For example, it does not require participant authentication 
and does not attempt to prevent data disclosure. 


Particular message attributes might expose specific security considerations. For example, the blind 
carbon copy feature of the architecture invites disclosure concerns, as discussed in Section 7.2 of 
[RFC5321] and Section 5 of [RFC5322]. Transport of text or non-text content in this architecture has 
security considerations that are discussed in [RFC5322], [RFC2045], [RFC2046], and [RFC4288]; 
also, security considerations are present for some of the media types registered with IANA. 


Agents that automatically respond to email raise significant security considerations, as discussed 
in [RFC3834]. Gateway behaviors affect end-to-end security services, as discussed in [RFC2480]. 
Security considerations for boundary filters are discussed in [RFC5228]. 


See Section 7.1 of [RFC5321] for a discussion of the topic of origination validation. As 
mentioned in Section 4.1.4, it is common practice for components of this architecture to use the 
RFC0791.SourceAddr to make policy decisions [RFC2505], although the address can be "spoofed". 
It is possible to use it without authorization. SMTP and Submission authentication ([КЕС4409], 
[RFC4954]) provide more secure alternatives. 


The discussion of trust boundaries, ADMDs, Actors, roles, and responsibilities in this document 
highlights the relevance and potential complexity of security factors for operation of an Internet Mail 
service. The core design of Internet Mail to encourage open and casual exchange of messages has 
met with scaling challenges, as the population of email participants has grown to include those with 
problematic practices. For example, spam, as defined in [RFC2505], is a by-product of this architecture. 
A number of Standards Track or BCP documents on the subject have been issued (see [RFC2505], 
[RFC5068], and [RFC5235]). 


Internationalization 


The core Internet email standards are based on the use of US-ASCII -- that 15, SMTP [RFC5321] 

and IMF [RFC5322], as well as their predecessors. They describe the transport and composition 

of messages as composed strictly of US-ASCII 7-bit encoded characters. The standards have been 
incrementally enhanced to allow for characters outside of this limited set, while retaining mechanisms 
for backwards-compatibility. Specifically: 

e Тһе MIME specifications ([RFC2045], [RFC2046], [RFC2047], [RFC2049]) allow for the use of 
coded character sets and character-encoding schemes ("charsets" in MIME terminology) other than 
US-ASCII. MIME's [RFC2046] allows the textual content of a message to have a label affixed 
that specifies the charset used in that content. Equally, MIME's [RFC2047] allows the textual 
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content of certain header fields in a message to be similarly labeled. However, since messages 
might be transported over SMTP implementations only capable of transporting 7-bit encoded 
characters, MIME's [RFC2045] also provides for "content transfer encoding" so that characters of 
other charsets can be re-encoded as an overlay to US-ASCII. 


MIME's [RFC2045] allows for the textual content of a message to be in an 8-bit character-encoding 
scheme. In order to transport these without re-encoding them, the SMTP specification supports 
an option [RFC1652] that permits the transport of such textual content. However, the [RFC1652] 
option does not address the use of 8-bit content in message header fields, and therefore [RFC2047] 
encoding is still required for those. 


A series of experimental protocols on Email Address Internationalization (EAT) have been released 
that extend SMTP and IMF to allow for 8-bit encoded characters to appear in addresses and other 
information throughout the header fields of messages. [RFC5335] specifies the format of such 
message header fields (which encode the characters in UTF-8), and [RFC5336] specifies an SMTP 
option for the transport of these messages. 

MIME's [RFC2045] and [RFC2046] allow for the transport of true multimedia material; such 
material enables internationalization because it is not restricted to any particular language or locale. 
The formats for Delivery Status Notifications (DSNs — [RFC3462], [RFC3463], [RFC3464]) 
and Message Disposition Notifications (MDNs — [RFC3798]) include both a structured and 
unstructured representation of the notification. In the event that the unstructured representation is 
in the wrong language or is otherwise unsuitable for use, this allows an MUA to construct its own 
appropriately localized representation of notification for display to the User. 

POP and IMAP have no difficulties with handling MIME messages, including ones containing 
Sbit, and therefore are not a source of internationalization issues. 


Hence, the use of UTF-8 is fully established in existing Internet Mail. However, support for long- 
standing encoding forms is retained and is still used. 
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